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(54) Title: SYNTHETIC GENE 



(57) Abstract 

Synthetic DNA coding for horseradish 
peroxidase includes the sequence (I), and in- 
corporates useful restriction sites at frequent in- 
tervals to facilitate the cassette mutagenesis of 
selected regions. Also included are flanking 
restriction sites to simplify the incorporation of 
the gene into any desired expression system. 
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SYNTHETIC GENE 

This invention relates to synthetic genes coding for 
horseradish peroxidase. 

Horseradish peroxidase C (E. C. 1. 11. 1. 7) (HRP) is the 
major peroxidase isozyme isolated from the horseradish 
( Armoracia rusticana ) . It is a monomeric glycoprotein 
of 308 amino acids the polypeptide chain having a MW of 
33,980 D. There are three neutral carbohydrate side 
chains and 4 disulphide bridges. The amino acid 
sequence of the mature protein has been determined. 
The presence of a pyrrolidonecarboxylyl amino terminus 
indicates that the protein is probably produced as a 
precursor form that is processed on secretion. The 
active form of the enzyme contains a hemin prosthetic 
group . 

The enzyme is particularly stable and is amenable to 
crosslinking and derivitisation without excessive loss 
of activity. This together with its wide range of 
chromogenic substrates, some of which give rise to 
insoluble, chemi luminescent or flourescent products, 
and the low background activities observed in most 
applications, have made horseradish peroxidase an 
invaluable tool for diagnostic and research 
applications in the fields of immunology, 
histochemistry, cytology and molecular biology. A 
further advantage it presents over other enzymatic 
markers is that some some substrates for the enzyme 
give rise to electron dense products that allow 
correlation of peroxidase location with cellular 
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ultrastructure using electron microscopy. In addition, 
horseradish peroxidase is electron dense itself by 
virtue of the Fe it contains and as a result can act as 
an E.M. marker in its own right. Particular 
applications have been found in immunochemistry, where 
peroxidase cross linked to immunoglobulin is widely 
used in both ELISA based assay systems and 
immunocytochemistry . Methods have been described that 
use either direct crosslinking of peroxidase to the 
immunoglobulin or indirect crosslinking of biotin 
labelled immunoglobulin to a streptavidin/horseradish 
peroxidase complex. Such streptavidin complexes have 
also found widespread application in nucleic acid 
hybridisation methods where biotinylated probe 
sequences can be localised by sequential incubation 
with the streptavidin/peroxidase complex and a suitable 
chromogenic peroxidase substrate. 

The amino acid sequence of horseradish peroxidase is 
taught by Welinder, K.G. ( Eur. J. Biochem. 96, 483-502 
(1979)). The cloning of the cDNA or natural gene for 
horseradish peroxidase has not been described. 

In order to facilitate the dissection of the 
structure/function relationships of HRP, its 
incorporation into expression vectors and the 
production of novel chimeric proteins containing HRP 
functionality an improved novel synthetic gene for the 
peroxidase C produced by Armoracia rusticana is sought. 



It is by no means easy to predict the design of an 
improved HRP gene, since the factors that determine the 
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expressibility of a given DNA sequence are still poorly 
understood. Furthermore, the utility of the gene in 
various applications will be influenced by such 
considerations as codon usage and restriction sites. 
The present invention relates to a synthetic HRP gene 
which has advantages in the ease with which it can be 
modified due to the presence of useful restriction 
sites. 

When synthesising and assembling genes, problems have 
been encountered when there are inverted or direct 
repeats greater than eight bases long in the genetic 
sequence. In addition, areas of unbalanced base 
composition such as G/C or A/T rich regions or 
polypurine/polypyrimidine tracts have been found to 
lead to inefficient expression. The present invention 
seeks to overcome or at least alleviate these 
difficulties. 

According to a first aspect of the invention, there is 
provided DNA coding for HRP and having restriction 
sites for the following enzymes: 

Heal, Sad, Ssp I. Nhel. Nsil, Clal, Pvul, Sphl, 
BspM I , Fsp l, Rsr II . Sai l, Bgl ll, Bal l, Pvu II, 
Xhol, BspMII , BstEII, SnaBI, PflMI , ApaLI , Sfiel, 
Seal, Xbal, StuI, Bell, BstXI , Ncol, Kpnl, and 
Pst l. 

The DNA may also contain a 5' Hindlll site and/or a 5' 
Ndel site and/or a 3' Bam HI site and/or a 3' EcoRl 
site. 
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According to a second aspect of the invention, there is 
provided DNA including the following sequence: 
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The above sequence may be immediately preceeded by an 
initiation codon (ATG) and immediately followed by a 
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termination codon (TAA) , but this will not necessarily 
be the case if the DNA incorporates linker (s) and/or 
*extension(s) , such as a sequence coding for a signal 
peptide, for example for efficient expression in 
eukaryotic cells such as mammalian cells. One extension 
which gives good expression in mammalian cells is a 5'- 
extension coding for the amino acids 
KCSWVTFFLMAWTGVNS , which may be provided between an 
initiation codon and the codon coding for the first Q 
residue. A preferred such extension is shown in Figure 
6. A sequence coding for a 3 ' -signal sequence may code 
for LLHDMVEWDFVSSM; a preferred DNA sequence coding 
for this series of amino acid residues is also shown in 
Figure 6. 

A synthetic HRP gene as described above incorporates 
useful restriction sites at frequent intervals to 
facilitate the cassette mutagenesis of selected 
regions. Also included in preferred embodiments are 
flanking restriction sites to simplify the 
incorporation of the gene into any desired expression 
system. 

Codons are those that are favoured by E. coli but it is 
expected that the DNA would be suitable for expression 
in other organisms including yeast and mammalian cells. 

According to a third aspect of the invention, there is 
provided a genetic construct comprising DNA according 
to the first or second aspect or a fragment thereof. 
The fragment may comprise at least 10, 20, 30, 40 or 50 
nucleotides. A genetic construct in accordance with the 
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third aspect may be a vector, such as a plasmid, cosmid 
or phage. 

According to a fourth aspect of the invention, there is 
provided a process for the preparation of DNA in 
accordance with the first or second aspect or a genetic 
construct in accordance with the third aspect, the 
process comprising coupling successive nucleotides 
and/ or ligating appropriate oligomers. 

The invention also relates to other nucleic acid 
(including RNA) either corresponding to or 
complementary to DNA in accordance with the first or 
second aspects. 

The invention encompasses a process for the production 
of monodisperse horseradish peroxidase C comprising the 
expression' of at least part of a genetic construct as 
described above. 

Further, the invention extends to constructs as 
described above comprising all or a fragment of a 
sequence in accordance with the first or second aspect 
fused to any other sequence of DNA so as to result in a 
sequence capable of encoding a hybrid protein 
possessing peroxidase activity. An example of such a 
construct is a genetic fusion between a gene encoding 
horseradish peroxidase and a gene encoding streptavidin 
or avidin such that the encoded fusion protein 
possesses both biotin binding and peroxidase activity. 
Another example is a genetic fusion between a gene 
encoding horseradish peroxidase and a gene encoding an 
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immunoglobul in-derived antigen binding function such 
that the fusion protein possesses both antigen binding 
and horseradish peroxidase activity. The antigen 
binding function may be an immunoglobulin heavy chain 
or light chain or fragments thereof or an engineered 
monomer ic antigenic recognition site. 

Particular constructs of interest include : vectors 
comprising the gene for horseradish peroxidase C that 
enable the production of fusions between horseradish 
peroxidase and any other protein of interest; and 
expression vectors that provide for the co-expression 
of the gene for horseradish peroxidase and another gene 
of interest either as a single fusion product, as a 
single polycistronic message or as two separate but 
linked transcriptional units. 

According to a further aspect of the invention, there 
is provided a gene for horseradish peroxidase 
containing a mutation (either missense, nonsense, 
deletion, insertion, duplication or other re- 
arrangement) that destroys or impairs the activity of 
the encoded horseradish peroxidase protein. The 
invention extends to genetic constructs including all 
or a fragment of such a mutant horseradish peroxidase 
gene. 

Defective or non-defective horseradish peroxidase 
genetic constructas can be employed (for example as 
markers) in mammalian cells and/or in transgenic 
animals . 
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Specific applications of synthetic genes for 
horseradish peroxidase, which themselves form further 
aspects of the invention , are disclosed in greater 
detail below: 

1) The gene can be incorporated into a suitable 
expression vector to allow for the efficient production 
of the enzyme in a compatible organism. This will have 
the advantage of being a ready source of a monodisperse 
enzyme preparation free of the contaminating isozymes 
present in the material isolated from horseradish root. 
Varying the organism or cell type chosen for production 
will also allow for the production of HHP with 
different patterns of glycosylation, including no 
glycosylation. Such material will have better defined 
properties that will make it more suitable for more 
demanding histochemical applications and sensitive 
enzyme assays , especially immunoassays. 

2) The gene can be incorporated into an HRP- 
streptavidin or HRP-avidin gene fusion. This will 
allow for the production of streptavidin-HRP or avidin- 
HRP complexes without the need for cross-linking. 
Again this will allow for a better defined, more stable 
product and will probably result in less loss of both 
biotin binding and peroxidase activity. 

3) Similarly, fusions between immunoglobulins and HHP 
or protein A and HRP can be produced that would be 
valuable histochemical reagents. Again the need for 
the usual cross-linking procedures would be avoided. 
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4) The HRP gene would have valuable applications in 
the construction of vectors designed to allow the 
production of fusions between HRP and any other protein 
for which a gene or cDNA had been cloned or for which 
the amino acid sequence is known. This would be useful 
both for monitoring the expression of a gene the 
product of which is difficult to assay and to tag the 
protein of interest to allow its metabolism and 
pharmodynamics to be followed in vivo by the use of the 
appropriate histochemical techniques or enzyme assays. 
Additionally, HRP fusions will allow for a simple 
immunopurif ication of the fusion product through the 
use of an appropriate anti-HRP antibody. 

5) The expression of HRP will be a useful marker in 
expression systems, eg mammalian cell expression 
systems. The HRP gene could be expressed either as a 
fusion or on a polycistronic message with the gene of 
interest, or as a separate but closely linked 
transcriptional unit. The production of the easily 
assayed HRP could be readily screened for and used as 
an indication as to which clones of cells were likely 
to be expressing large quantities of the deisred 
product. The use of fluorescent or chemiluminescent 
HRP chromogenic substrates would allow for the 
possibility of directly selecting high producing 
eukaryotic cells by fluorescence activated cell-sorting 
(FACS) . 

6) HRP genes carrying mutations (missense, nonsense, 
deletion, insertion, duplication or other re- 
arrangement) that destroy or impair the enzymatic 
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activity of the resultant product would allow the 
construction of vectors that could be used to follow 
the frequency of reversion or suppression of the 
particular mutation introduced into the gene* 

The introduction of such defective HRP genes into the 
germ line of the organism of interest would also enable 
a researcher to fate-map particular cell-types by 
histologically examining the pattern of HRP activity in 
the tissue of interest. Care would have to be 
excercised in constructing a mutant HRP gene with the 
correct in vivo reversion rate so that areas of HRP 
activity and hence the presence of reverted HRP gene 
could be taken as evidence for the clonal origin of the 
HRP+ cells. The intact synthetic non -mutant gene could 
also be used for such fate-mapping experiments by 
infection of an organism with the HRP gene in a 
siutable vector such as a retroviral vector or 
transposon. 

7) The advantage of a synthetic gene for HRP allows for 
the production of HRP genes modified to encode a 
protein carrying small additional sequences, such as N- 
or C- terminal extensions. These will be of great 
application in simplifying the purification of the HRP 
and/or increasing the ease and enhancing the 
specificity with which it can be cross-linked to other 
proteins of interest or otherwise derivatised. For 
example , a C-terminal extension of six to eight Arg 
residues could be used to simplify purification by 
analogy with the technique of Sassenfeld et al. 
Bio/technology 2 76 (1984). Alternatively, a tail of 
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Lys residues would provide an accessible and sensitive 
site for reaction with bifunctional cross-linking 
reagents such as glutaraldehyde . 

Preferred embodiments and examples of the invention 
will now be described. In the following description, 
reference is made to a number of drawings, in which: 



Figure 1 



shows the amino acid sequence of 
horseradish peroxidase C; 



Figure 2 



shows the sequence of the horseradish 
peroxidase synthetic gene; a summary of 
useful restriction sites; and a 
sequence of front and back halves of the 
gene that were initially cloned; 



Figure 3 



shows a sequence of synthetic 
horseradish peroxidase gene divided into 
oligonucleotides ; 



Figure 4 



shows a summary of assembly procedure 
used; 



Figure 5 



shows the structure of the HRP E. coli 
expression plasmid pSD18 ; 



Figure 6 



shows a synthetic HRP gene modified for 
efficient expression in mammalian cells; 
and 



Figure 7 



shows the structure of the HRP mammalian 
expression plasmid pCP21. 
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Example 1 

The gene was designed to be synthesised and cloned, in 
this example, in two halves with a final sub-cloning 
step to yield the full length gene. The sequence of 
the two halves of the gene together with that of the 
final product are depicted in Figure 2. The final 
synthetic gene encodes the entire mature horseradish 
peroxidase protein together with the required initiator 
methionine residue but lacks the leader sequence that 
is assumed to be present in the natural gene. It is 
envisaged that the leader sequence appropriate to the 
expression system of choice would be added to the 
synthetic gene as required or ommitted to allow for 
intracellular expression of the gene. 

The desired gene sequence was divided into a front half 
and a back half of 501 and 474 bp respectively. Both 
halves were designed with a common Xho l site to allow 
for the complete gene to be assembled with a simple 
cloning step. The front and back halves of the gene 
were divided into 24 and 22 oligodeoxyribonucleotides 
(oligomers) respectively as depicted in Figure 3. The 
division was such as to provide 7 base cohesive ends 
after annealing complementary pairs of oligomers. The 
end points of the oligomers were chosen to minimise the 
potential for inappropriate ligation of oligomers at 
the assembly stage. 

The oligomers were synthesised by automated solid phase 
phosphoramidite chemistry. Following de-blocking and 
removal from the controlled pore glass support the 
oligomers were purified on denaturing polyacrylamide 
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gels f further purified by ethanol precipitation and 
finally dissolved in water prior to estimation of their 
concentration - 

All the oligomers with the exception of the 5' terminal 
oligomers BB279 and BB3 02 for the front half and BB303 
and BB324 for the back half were then kinased to 
provide them with a 5' phosphate as required for the 
ligation step. Complementary oligomers were then 
annealed and the oligomers ligated together by T4 DNA 
ligase as depicted in Figure 4. The ligation products 
were separated on a 2% low gelling temperature (LGT) 
gel and the bands corresponding to the front and back 
halves of the horseradish peroxidase gene were cut out 
and extracted from the gel. The purified fragments 
were then ligated separately to EcoRI/Hindlll cut DNA 
of the plasmid vector pUC18. The ligated products were 
transformed into HW87 and plated on L-agar plates 
containing 100 meg ml -1 ampicillin. Colonies 
containing potential clones were then grown up in L- 
broth containing ampicillin at 100 meg ml -1 and plasmid 
DNA isolated. Positive clones were identified by 
direct dideoxy sequence analysis of the plasmid DNA 
using the 17 base universal primer, a reverse 
sequencing primer complementary to the opposite strand 
on the other side of the polylinker and some of the 
oligomers employed in the assembly of the gene that 
served as internal primers. One front half and one 
back half clone were subsequently re-sequenced on both 
strands to confirm that no mutations were present. The 
complete gene was then assembled by isolating the 4 66 
bp Xho I- Eco RI fragment from the back half calone that 
contained the 3' end of the gene and ligating it to a 
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front half clone that had also been digested with EcoRI 
and Xho l . The identity of the final construct was 
confirmed by restriction analysis and subsequent 
complete resequencing. 

All the techniques of genetic manipulation used in the 
manufacture of this gene are well known to those 
skilled in the art of genetic engineering. A 
description of most of the techniques can be found in 
one of the following laboratory manuals: Molecular 
Cloning by T. Maniatis, E.F. Fritsch and J. Sambrook 
published by Cold Spring Harbor Laboratory , Box 100, 
New York, USA, or Basic Methods in Molecular Biology by 
L.G. Davis, M.D. Dibner and J.F. Battey published by 
Elsevier Science publishing Co. Inc. New York, USA. 

Additional and modified methodologies are detailed 
below. 

1) Oligonucleotide synthesis 

The oligonucleotides were synthesised by automated 
pho sphor amid ite chemistry using cyanoethyl 
phosphoramidtes . The methodology is now widely used 
and has been described (Beaucage, S.L. and Caruthers, 
M.H. Tetrahedron Letters . 24, 245 (1981)). 

2) Purification of Oligonucleotides 

The oligonucleotides were de-protected and removed from 
the CPG support by incubation in concentrated NH3 , 
Typically, 50 mg of CPG carrying 1 micromole of 
oligonucleotide was de-protected by incubation for 5 hr 



WO 89/03424 



PCT/GB88/00833 



15 

at 70° in 600 racl of concentrated NH 3 . The supernatant 
was transferred to a fresh tube and the oligomer 
precipitated with 3 volumes of ethanol. Following 
centrifugation the pellet was dried and resuspended in 
1 ml of water. The concentration of crude oligomer was 
then determined by measuring the absorbance at 260 run. 

For gel purification 10 absorbance units of the crude 
oligonucleotide were dried down and resuspended in 15 
mcl of marker dye (90% de-ionised formamide, lOmM tris, 
10 mM borate, ImM EDTA, 0.1% bromophenol blue). The 
samples were heated at 90° for 1 minute and then loaded 
onto a 1.2 mm thick denaturing polyacrylamide gel with 
1.6 mm wide slots. The gel was prepared from a stock 
of 15% acrylamide, 0.6% bisacrylamide and 7M urea in 1 
X TBE and was polymerised with 0.1.% ammonium 
persulphate and 0.025% TEMED. The gel was pre-mn for 1 
hr. The samples were run at 1500 V for 4-5 hr. The 
bands were visualised by UV shadowing and those 
corresponding to the full length product cut out and 
transferred to micro-testubes . The oligomers were 
eluted from the gel slice by soaking in AGEB (0.5 M 
ammonium acetate, 0.01 M magnesium acetate and 0.1 % 
SDS) overnight. The AGEB buffer was then transferred 
to fresh tubes and the oligomer precipitated with three 
volumes of ethanol at -70° for 15 min. The precipitate 
was collected by centrifugation in an Eppendorf 
microfuge for 10 min, the pellet washed in 80 % 
ethanol, the purified oligomer dried, redissolved in 1 
ml of water and finally filtered through a 0.45 micron 
micro-filter. The concentration of purified product 
was measured by determining its absorbance at 260 nm. 
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3) Kinasing of oligomers 

250 pmole of oligomer was dried down and resuspended in 
20 mcl kinase buffer (70 mM Tris pH 7.6, 10 mM MgCl2 / 1 
mM ATP, 0.2 mM spermidine, 0.5 mM dithiothreitol). 10 
u of T4 polynucleotide kinase was added and the mixture 
incubated at 3 7° for 3 0 min. The kinase was then 
inactivated by haeating at 85° for 15 min. 

4 ) Annealing 

8 mcl of each oligomer was mixed , heated to 90° and 
then slow cooled to room temperature over a period of 
an hour. 

5 ) Ligation 

5 mcl of each annealed pair of oligomers were mixed and 
10 X ligase buffer added to give a final ligase 
reaction mixture (50 mM Tris pH 7.5, 10 mM MgCl 2 , 2 0 mM 
dithiothreitol, 1 mM ATP. T4 DNA ligase was added at a 
rate of 100 u per 50 mcl reaction and ligation carried 
out at 15° for 4 hr. 

6) Agarose gel electrophoresis 

Ligation products were separated using 2% low gelling 
temperature agarose gels in 1 X TBE buffer (0.094 M 
Tris pH8.3, 0.089 M boric acid, 0.25 mM EDTA) 
containing 0.5 meg ml" 1 ethidium bromide. 
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7) Isolation of ligation products 

The band corresponding to the expected horseradish 
peroxidase gene or gene fragment ligation product was 
identified by reference to size markers under long wave 
UV illumination. The band was cut out of the gel and 
the DNA extracted as follows. 

The volume of the gel slice was estimated from its 
weight and then melted by incubation at 65° for 10 min. 
The volume of the slice was then made up to 400 mcl 
with TE (10 mM Tris pH 8.0, 1 mM EDTA) and Na acetate 
added to a final concentration of 0.3 M. 10 meg of 
yeast tRNA was also added as a carrier. The DNA was 
then subjected to three rounds of extraction with equal 
volumes of TE equilibrated phenol followed by three 
extractions with ether that had been saturated with 
water. The DNA was precipitated with 2 volumes of 
ethanol, centrifuged for 10 min in a micro fuge, the 
pellet washed in 70 % ethanol and finally dried down. 
The DNA was taken up in 20 mcl of TE and 2 mcl run on a 
2 % agarose gel to estimate the recovery of DNA. 

8) Cloning of fragments 

For the initial cloning of the two halves of 
horseradish peroxidase 0.5 meg of pUC18 DNA was 
prepared by cleavage with Hindlll and EcoRI as advised 
by the suppliers. The digested DNA was run on an 0.8 % 
LGT gel and the vector band purified as described 
above. For the final assembly step the clone carrying 
the front half of the horseradish peroxidase gene was 
treated similarly using the enzymes Xho l and EcoRI. 
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20 ng of cut vector DNA was then ligated to various 
peroxidase gene DNA ranging from 2 to 20 ng for 4 hr 
using the ligation buffer described above . The 
ligation products were used to transform competent HW87 
as has been described. Ampicillin resistant 
transformants were selected on L-agar plates containing 
100 meg ml - " 1 ampicillin. 

9) Isolation of plasmid DNA 

Plasmid DNA was prepared from the colonies containing 
potential horseradish peroxidase clones essentially as 
described (Ish-Horowicz , D. , Burke, J.F. Nucleic Acids 
Research 9 2989-2998 (1981) . 

10) Dideoxy sequencing 

The protocol used was essentially as has been described 
(Biggin, M.D. , Gibson, T.J. , Hong, G.F. P.N.A.S. 80 
3963-3965 (1983)). The method was modified to allow 
sequencing on plasmid DNA as described (Guo, L-H. , Wu, 
R- Nucleic Acids Research 11 5521-5540 (1983). 

11) Transformation 

Transformation was accomplished using standard 
procedures. The strain used as a recipient in the 
cloning was HW87 which has the following genotype: 

ara D139 ( ara - leu ) de!7697 (lacIPOZY) de!74 galU gal K hsd R 
rps L srl rec A56 

Any other standard cloning recipient such as HB101 
would be adequate. 
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Example 2 

The front end of the synthetic HRP gene prepared in 
Example 1 was modified by the replacement of the 
Hin di II- Hpa l fragment with a synthetic linker carrying 
an Ndel site on the initiator ATG as follows: 

M Q L T . 

AAG CTTCATATG CAGTTAAC C 

Hindlll Nde l Hpa l 

Example 3 

Expression of the Synthetic Horseradish Peroxidase Gene 
in Escherichia coli 

The synthetic HRP gene of Example 2 was cloned into the 
expression vector pGC517 on a Nde l- BamH I fragment to 
give the plasmid pSD18. The host vector pGC517 was 
prepared from the known plasmid pAT153 (Twigg & 
Sherratt Nature 283, 216-218 (1980) ), which is now a 
standard E . coli high expression vector, by the 
incorporation by standard methods of the known tac 
promoter sequence and a termination sequence. pAT153 
is itself a derivative of pBR322. In pGC517 the HRP 
gene is expressed from the powerful and regulatable tac 
promoter. To ensure that expression remained repressed 
in uninduced cultures the plasmid was maintained in E^ 
coli strain W3110 lac l q , which is widely available, in 
which the lac repressor protein is over-produced. 
Figure 5 depicts the structure of pSD18 . 

Strain W3110 lad q -pSD18 was grown in M9 minimal medium 
containing 0.2% glucose and 0.2% casamino acids. At an 
O.D. of 0.2 - 0.3 the culture was induced by the 
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addition of IPTG to a final concentration of 5mM. The 
culture was grown for a further 3 hr with samples 
removed at 30 min intervals. 

Microscopic examination of the induced culture revealed 
the presence of* inclusion bodies, characteristic of the 
accumulation of large amounts of insoluble aggregated 
protein within the cell. In addition, cultures 
expressing HRP at high levels acquired a pink 
colouration, perhaps related to the overexpression of a 
haem protein. SDS/PAGE analysis subsequently revealed 
the presence of a large amount of a 33 kD protein, 
estimated at 10-20% of total cell protein in induced 
but not uninduced cultures. Western blot analysis 
confirmed that this protein was HRP. 

Standard methods for inclusion body isolatipn could be 
applied to obtain a substantial purification of the 
denatured HRP as insoluble aggregates. This material 
was then dissolved in 6 M guanidine HC1 prior to 
renaturation . For renaturation, the dissolved HRP was 
dialysed against 8 M urea, 50mM Tris HC1, lOOmM NaCl 
for 24 hr. Ca 2+ was then added (as CaCl 2 ) to 1 mM and 
the sample incubated for 2 hr at room temperature. 
This procedure resulted in the recovery of about 0.125% 
of the expected HRP activity by the standard pyrogallol 
colorimetric assay and based on the protein 
concentration and estimated purity of the preparation 
(see Table 1) . 
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Table 1 - Renaturation of HRP Expressed in E. coli 



s 

a 
m 

P 
1 
e 


Conditions 


Rate of reaction 
(maximum) 

>TT Am4 n 

AiJ/mxn 


Amount of 
recombinant 

TTOT"> 

xiKf L. 

assayed 
mog 


Activity 
AU/min meg 

wun TJDT3 

rec. xiKir 


Activity 
(% of max. 

dtuLVlty vJJ_ 

commp.rtzial 
HRP) 


i 

X 


1st 

dialysis 


n m ATT/n ft min 


25 ITK^rj 


5xl0~^ ATT An in 

**JX* A V/ fM/^ 111 till 

meg 


0.007% 


2 


after 
1st 

dialysis 


0.015 AD/1.1 min 


5.77 meg 


0.0024 AU/min 
meg 


0.034% 


3 


sample 2 
incubated 
with 1 ieM 
Ca 2+ for 2h 


0.01 AD/1.5 min 


0.76 meg 


0.029 AU/min 
mog 


0.125% 



Control samples prepared from similar cultures carrying 
the expression plasmid without the HRP gene gave 
backgrounds about 1000 fold less than this. The assay 
mixture contained freshly prepared pyrogallol and 
■peroxide in the following concentrations: llmM K 
phosphte, pH 6.0, 8mM H 2 0 2 , 0.55% w/v pyrogallol in 
H 2 0. The HRP was added and the increase in adsorption 
at 420nm was followed. 

Thus the synthetic HRP gene is capable of high level 
expression in E. col i and is capable of directing the 
synthesis of active product. 
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Example 4 

The synthetic HRP gene of Example 2 was modified as 
follows to allow for its efficient expression in 
mammalian cells: 

(a) The 3' end of the gene was extended from the Pst 1 
site to include the C-terminal extension reported by 
Fujiyama et al. Eur. J. Biochem, 173, 681-687 (1988). 

(b) The 5' end of the gene was modified by the 
addition of a Hindlll/ Hpa l linker which encoded a 
signal sequence based on an immunoglobulin 
s ignalpep t ide • 

The modified HRP gene is depicted in Figure 6, and will 
be referred to as HPRX. 

Example 5 

Expression of the Synthetic Horesradish Peroxidase Gene 
in Mammalian Cells 

The HRPX gene of Example 4 was inserted into the 
mammlian cell expression vector pCPHll to give pCP21, 
in which the HRP gene is expressed from the HCMV (Human 
Cytomegalovirus) early promoter, see Figure 7. The 
plasmid pCPHll is based on pUC18, which is widely 
avaialble and from which it can be prepared by standard 
methods , using the information in Figure 7. 

The HRP expression plasmid pCP21 was transfected into 
COS cells using the standard technique of calcium 
phosphate precipitation (2 0mcg DNA transfected per 10 6 
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cells) . HRP activity was assayed in cell culture 
medium, 48-72h post trans feet ion using tetra -methyl 
benzidine substrate (TMB) , a standard HRP reagent. No 
HRP activity was detectable in control constructs which 
did not contain a signal sequence and/or the 3 9 
extension. In contrast, HRP activity was clearly 
detectable in cells transfected with pCP21 (up to lOx 
greater than in controls) . The results are shown in 
Table 2. 



Table 2 - HRP Expression in COS Cells 



Vol. Extract, 
(mcl) 



pCP21 



O.D.450 

Plasmid 
PCP22 . PCPU 



PCP12 



100 
50 
25 
10 
5 
1 



.004 

.033 

.107* 

.084* 

.064* 

.028 



.022 
.012 
.010 
.011 
.015 
.008 



.008 
.010 
.003 
. 007 
.012 
.007 



. 003 
.013 
.016 
.017 
.011 
.007 



KEY 

pCP21 HRP with N and C terminal signals , correct 

orientation. 

pCP22 HRP with N and C terminal signals, wrong 

orientation. 

pCPll HRP with no signal sequences, correct 

orientation . 

pCP12 HRP with no signal sequences, wrong 

orientation. 
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All results are the mean of duplicate samples, 
* = significant level of activity. 

HRP Assay 

For assaying cell extracts, a substrate mix was 
prepared as follows: 

TMB (3 ,3', 5 ,5' tetramethyl benzidine (Sigma)) was 
dissolved to 10 mg/ml in DMSO and 100 mcl of this 
solution added to 100 ml of assay buffer (0.1M NaAc in 
citric acid, pH6.0) along with 100 mcl H 2 o 2 . 

A cell extract was prepared by collecting the cells by 
centrif ugation followed by freeze thawing or 
sonication. The medium, cell lysates and standards 
were aliquoted in 96 well microtitre plates as 'follows: 

Sample 100 50 25 10 5 1 mcl 
Assay 0 50 75 90 95 99 mcl 

Buffer 

Blank samples were set up using 100 mcl of assay buffer 
alone. 100 mcl of TMB/H 2 0 2 mix was added to the 
samples of incubated at RT for 30 mins to 1 hour. The 
reaction was stopped by the addition of 50 mcl of 2.5M 
H 2 S0 4 and the colour change read at 450 nm on a plate 
reader. 

Commercially available HRP was used as a standard 
diluted by a factor of 10~ 6 . 
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CLAIMS 

1. DNA coding for horseradish peroxidase and having 
restriction sites for the following enzymes: 

Hoa l, Sad , Ssp l, Nhe l, Nsil, Clal, Pvul, SphI, 
BspMI, £spl, RsrII, Sail, Bglll, Ball. PvuII, 
Xho l. BspM II, BstEII, Sna BI . PflMI, Apa LI . Spe l, 
Seal, Xba l. Stu I , Bel l, Bst XI , Nco l, Kpn l and PstI 

2. DNA including the following sequence: 



CAG 


TTA 


ACC 


CCT 


ACA 


TTC 


TAC 


GAC 


AAT 


AGC 


TGT 


CCC 


AAC 


GTG 


TCC 


AAC 


ATC 


GTT 


CGC 


GAC 


ACA 


ATC 


GTC 


AAC 


GAG 


CTC 


AGA 


TCC 


GAT 


CCC 


AGG 


ATC 


GCT 


GCT 


TCA 


ATA 


TTA 


CGT 


CTG 


CAC 


TTC 


CAT 


GAC 


TGC 


TTC 


GTG 


AAT 


GGT 


TGC 


GAC 


GCT 


AGC 


ATA 


TTA 


CTG 


GAC 


AAC 


ACC 


ACC 


AGT 


TTC 


CGC 


ACT 


GAA 


AAG 


GAT 


GCA 


TTC 


GGG 


AAC 


GCT 


AAC 


AGC 


GCC 


AGG 


GGC 


TTT 


CCA 


GTG 


ATC 


GAT 


CGC 


ATG 


AAG 


GCT 


GCC 


GTT 


GAG 


TCA 


GCA 


TGC 


CCA 


CGA 


ACA 


GTC 


AGT 


TGT 


GCA 


GAC 


CTG 


CTG 


ACT 


ATA 


GCT 


GCG 


CAA 


CAG 


AGC 


GTG 


ACT 


CTT 


GCA 


GGC 


GGA 


CCG 


TCC 


TGG 


AGA 


GTG 


CCG 


CTC 


GGT 


CGA 


CGT 


GAC 


TCC 


CTA 


CAG 


GCA 


TTC 


CTA 


GAT 


CTG 


GCC 


AAC 


GCC 


AAC 


TTG 


CCT 


GCT 


CCA 


TTC 


TTC 


ACC 


CTG 


CCC 


CAG 


CTG 


AAG 


GAT 


AGC 


TTT 


AGA 


AAC 


GTG 


GGT 


CTG 


AAT 


CGC 


TCG 


AGT 


GAC 


CTT 


GTG 


GCT 


CTG 


TCC 


GGA 


GGA 


CAC 


ACA 


TTT 


GGA 


AAG 


AAC 


CAG 


TGT 


AGG 


TTC 


ATC 


ATG 


GAT 


AGG 


CTC 


TAC 


AAT 


TTC 


AGC 


AAC 


ACT 


GGG 


TTA 


CCT 


GAC 


CCC 


ACG 


CTG 


AAC 


ACT 


ACG 


TAT 


CTC 


CAG 


ACA 


CTG 


AGA 


GGC 


TTG 


TGC 


CCA 


CTG 


AAT 


GGC 


AAC 


CTC 


AGT 


GCA 


CTA 


GTG 


GAC 


TTT 


GAT 


CTG 


CGG 


ACC 


CCA 


ACC 


ATC 


TTC 


GAT 


AAC 


AAG 


TAC 


TAT 


GTG 


AAT 


CTA 


GAG 


GAG 


CAG 


AAA 


GGC 


CTG 


ATA 


CAG 


AGT 


GAT 


CAA 


GAA 


CTG 


TTT 


AGC 
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AGT CCA AAC GCC ACT GAC ACC ATC CCA CTG GTG AGA 
AGT TTT GCT AAC TCT ACT CAA ACC TTC TTT AAC GCC 
TTC GTG GAA GCC ATG GAC CGT ATG GGT AAC ATT ACC 
CCT CTG ACG GGT ACC CAA GGC CAG ATT CGT CTG AAC 
TGC AGA GTG GTC AAC AGC AAC TCT 

3. A genetic construct comprising DNA as claimed in 
claim 1 or 2, or a fragment thereof. 

4. A construct as claimed in any one of claims 3 to 
8, which is a vector. 

5. A process for the preparation of DNA as claimed in 
claim 1 or 2 or a genetic construct in accordance with 
claim 3, the process comprising coupling successive 
nucleotides and/or ligating appropriate oligomers. 

6. DNA substantially as herein described with 
reference to Figure 2a. 

7. A construct as claimed in claim 3 comprising all 
or a fragment of the sequence defined in claim 1 or 2 
fused to any other sequence of DNA so as to result in a 
sequence capable of encoding a hybrid protein 
possessing peroxidase activity. 

8. A process for the production of monodisperse 
horseradish peroxidase C comprising the expression of 
at least part of a genetic construct as claimed in 
claim 3. 

9 . A construct which is a genetic fusion between a 
gene encoding horseradish peroxidase and a gene 
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encoding streptavidin or avidin such that the encoded 
fusion protein possesses both biotin binding and 
peroxidase activity. 

10. A genetic fusion between a gene encoding 
horseradish peroxidase and * a gene encoding an 
immunoglobulin-derived antigen binding function such 
that the fusion protein possesses both antigen binding 
and horseradish peroxidase activity. 

11. A genetic fusion as claimed in claim 10, wherein 
the antigen binding function is an immunoglobulin heavy 
chain or light chain or fragments thereof or an 
engineered monomeric antigenic recognition site. 

12. A construct as claimed in claim 4, which is a 
vector comprising the gene for horseradish peroxidase C 
that enables the production of fusions between 
horseradish peroxidase and any other protein of 
interest. 

13. A construct as claimed in claim 4, which is an 
expression vector that provides for the co-expression 
of the gene for horseradish peroxidase and another gene 
of interest either as a single fusion product , as a 
single polycistronic message or as two separate but 
linked transcriptional units. 

14. A gene for horseradish peroxidase containing a 
mutation (either missense, nonsense, deletion, 
insertion, duplication or other re-arrangement) that 
destroys or impairs the activity of the encoded 
horseradish peroxidase protein. 
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15. A genetic construct including all or a fragment of 
a mutant horseradish peroxidase gene as claimed in 
claim 14. 

16. The use of a defective horseradish peroxidase 
genetic construct as claimed in claim 15 where the 
genetic construct is employed in mammalian cells. 

17. The use of a defective horseradish peroxidase 
genetic construct as claimed in claim 15 where the 
genetic construct is employed in transgenic animals. 

18. Any novel feature or combination of features 
disclosed herein (whether or not otherwise claimed) . 



WO 89/03424 PCT/GB88/00833 

1/15 

FIGURE 1 

AMINO ACID SEQUENCE OF HORSERADISH PEROXIDASE (INCLUDING 
INITIATOR MEHHONINE) • 



(M) QLTPTFYDNSCPNVSNIVR 

10 20 

DTIVNELRSDPRIAASILRL 

30 40 

HFHDCFVNGCDASILLDNTT 

50 60 

SFRTEKDAFGNANSARGFPV 

70 80 

IDRMKAAVESACPRTVSCAD 

90 100 

LLTIAAQQSVTLAGGPSWRV 

HO 120 

PLGRRDSLQAFLDLANANLP 

130 140 

APFFTL PQLKDS FRNVGLNR 

150 16a 

SSDLVALSGGHTFGKNQCRF 

170 180 

IMDRLYNFSNTGLPDPTLNT 

190 200 

TYLQTLRGLCPLNGNLSALV 

210 220 

DFDLRTPTIFDNKYYVNLEE 

230 240 

QKGLIQSDQELFSSPNATDT 

250 260 

I PLVRSFANSTQTFFNAFVE 

270 280 

AMDRMGNITPLTGTQGQIRL 

290 300 



NCRVVNSNS 

309 
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SEQUENCE OF SYNTHETIC HORSERADISH PEROXIDASE GENE 



MQLTPTFYDNSCPNVSN 
AAGCTTAACCATCC^ 
HixiDHI Hpal 

10 20 30 40 50 60 




70 80 90 100 110 120 



LRLHFHDCFVNGCDASILLD 
A!ITAiOSKI[GC^^ 

Nhel 

TAATCCfcGAOGrt^^ 

130 140 150 160 170 180 

NTTS FRTEKDAFGNANSARG 
CAACACa^XAGTITC^ 

Nsil 

190 200 210 220 230 240 



FPVIDRMKAAVESACPRTVS 
CTITCX^AGrcATCGAT^ 

Clal/Pvul SphI 
GAAAGGTCACTAGCEA^ 

250 260 270 280 290 300 



CADLLTIAAQQSVTLAGGPS 
TICTGGAGAOOXXnGACIJfl^ 

BspMI Fspl RsrH 

AACACGTTCnX3GAOGAC3EA^ 

310 320 330 340 350 360 

WRVPLGRRDSLQAFLDLANA 
CTGCAGAGTCCXXXTOGCnOGA 

Sail BglTE/Ball 

G&OJIXnCAOGGC^^ 

370 380 390 400 410 420 

NLPAPFFTLPQLKDSFRNVG 
CAACTIGCCTGC^ 

PvuH 

430 440 450 460 470 480 



LNRSSDLVALSGGHTFGKNQ 
TCTGAAIOGCnX^^ 

Xhol BspMH 

490 500 510 520 530 540 
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CRFIMDRLYNFSNTGLPDP T 
GTCIAGCTTCATCATCGA^ 

BstEH 

550 560 570 580 590 600 

LNTTYLQTLRGLCPLNGNLS 

SnaBI PflMI 

610 620 630 640 650 660 

ALVDFDLRTPTIFDNKYYVN 

ApaU/Spel Seal 

670 680 690 700 710 720 

LEEQKGLIQSDQE L F S S P N A 
TCISGAGGAGCAGAAASGCC^^ 
Xbal StuI Bell 

730 740 750 760 770 780 

TDTIPLVRSFANSTQTFFNA 

BstXT 

790 800 810 820 830 840 

FVEAMDRMGNITPLTGTQGQ 
CTimiGGAAGCC^^ 

Ncol F£nl 
GAA3CACCTraX3TACC^^ 

850 860 870 880 890 900 

IRLNCRVVNSNS** 
GATIXCTCXGAAC^^ 

PstI BanfflL EcoKI 

910 920 930 940 950 % 
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FIGURE 2b 

SUMMARY OF USEFUL RESTRICTION SUES. 



ENZYME 


SEQUENCE 


POSITION 


HinDHI 


AAGCIT 


1 


Hpal 


CTTAAC 


16 


Sad 


GftGCTC 


86 


Sspl 


AM30T 


118 


Nhel 


GCTftGC 


164 


Nsil 


MGCAT 


210 


CLal 


ATOGAX 


251 


Pvul 


OGATCG 


253 


SphI 


GCAIGC 


281 


BspMI 


ACCTGC 


309 


Fspl 


TGOGCA 


325 


RsrII 


OGGACOG 


352 


Sail 


GTOGAC 


378 


BglH 


AGATCT 


406 


Ball 


TCGOCA 


411 


PvuII 


CAGCIG 


452 


Xhol 


CIOGAG 


490 


BspMZE 


TCOQGA 


512 


BstEU 


GCTTAOC 


585 


SnaBI 


TAOGIA 


610 


PflMI 


CCACIGAA3X3G 


641 


ApaLI 


CTGCAC 


660 


Spel 


ACEACT 


664 


Seal 


ACTACT 


708 


Xbal 


TCIAGA 


721 


StuI 


AGGOCT 


736 


Bell 


TGATCA 


751 


BstXI 


CCATCCCACTGG 


789 


Ncol 


CCAIGG 


852 


K£nl 


GCTAOC 


887 


PstI 


CTGCAG 


913 


BanKC 


GGATCC 


944 


EcoRI 


GAATTC 


950 



WO 89/03424 PCT/GB88/00833 

5/15 

FIGURE 2C 

SEQUENCE OF SYNTHETIC HORSERADISH PEROXIDASE GENE 5' END HALF 

MQLTPTFYDNSCPNVSN 
AAGCTIAACCAiroC^ 
HinDIH HpaT 
Ti03AATIX3GiaOGrC3^^ 

10 20 30 40 50 60 

IVRDTIVNELRSDPRIAASI 
GA3XXnTO30G3^CACA^ 

Nrul Sad Sspl 

70 80 90 100 110 120 

LRLHFHDCFVNGCDASILLD 

ATEACG?ICir^ 

Nhel 

130 140 150 160 170 180 

NTTSFRTEKDAFGNANSARG 
CAACACXACCACnTira^^ 

Nsil 

CTTCIGGTQGTCAAAGGOGIGAC^^ 

190 200 210 220 230 240 

FPVI DRMKAAVESACPRTVS 

Clal/Pvul SphI 
GAAAGGTCACTAGCraGOG^ 

250 260 270 280 290 300 

CADLLTIAAQQSVTLAGGPS 
TICTGCAGACCIGCTC^ 

BspMI FSpI RsrH 



310 320 330 340 350 360 

WRVPLGRRDSLQAFLDLANA 

CIGGAGACTGOOGC3XI3GTOGAOCT 

Sail Bglll/Ball 
GACCICTCAOGGO^^ 

370 380 390 400 410 420 

NLPAPFFTLPQLKD S F R N V G 

Pvun 

GITCAAO^OGAajrAAGAAG^ 

430 440 450 460 470 480 



L N R S 
TCTTCAAIOGCIOS^ 

Xhol EcbRI 
AGACTEAGOGAGCTOCTIAA^ 
490 
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SEQUENCE OF SYNTHETIC HORSERADISH PEROXIDASE GENE 3' END HALF, 



SSDLVALSGGHTFGKNQ 
AAGCITAACTaSAGIGACCT^ 
Hinnm Xhol BspMH 

490 500 510 520 530 540 



CRFIMDRLYNFSNTGLPDPT 

BstEH 

CACATOCAAGraCTAOCTATO^ 

550 560 570 580 590 600 

LNTTYLQTLRG L C P L N G N L S 
GCIGAACACTAOCTATC 

SnaBI PflMI 
CGACITGTCMG^ 

610 620 630 640 650 660 



ALVDFDLRTPTIFDNKYYVN 
TGCACEAGTCGAC^^ 
ApaLI/Spel Seal 
ACGTCAICACXTCA^ 

670 680 690 700 710 720 



LEEQKGLIQSDQELFSSPNA 
TCIAGAGGAGCAGAAAGGCIJEGAIA 
Xbal StuI Bell 

730 740 750 760 770 780 



TDTIPLVRSFANSTQTFFNA 
CACIGACAO^mXCACra^ 
BstXI 

GTCACICTtSGTAGGGTCAOCAC^ 

790 800 810 820 830 840 



FVEAMDRMGNITPLTGTQGQ 
CTIOJDGGAAGOraiG^^ 

Ncol Kpnl 
GAAGCACCrrOGGTAOC^^ 

850 860 870 880 890 900 

IRLNCRVVNSNS** 

GATTOGTTCIGAAC^ 

PstI Banm EcoRI 

CTAAGC3^GACITGAOGnDCTCA 

910 920 930 940 950 
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DESIGN OF OUGCMERS FOR SYNTHETIC HORSERADISH PEROXIDASE GENE 5 'HALF. 



BB279 

5' ^OZTIPACCNIX^ toocs^aoctstccaa 

t i i i i 1 1 f i i i i i i I i i i i i i i i i i i i i i i i i f i i i f i i i ilium 

3' ATTOGlEAOSTCAAITCGGGAIG^ CAGAGGTT 
10 20 30 40 50 60 

BB280 

BB281 BB283 
CATCTIITCGOCS^CA^ AGCICAGAIXXGATOCCAGGATOGC^ 

llllllllllllllllllllllllll lllllllllllllllllllllllllll 

GTAGCAAGOGCIGICTEAG^ TAGGCTAGGCTOCTAGOGAO 

70 80 90 100 110 120 

BB282 BB284 

HB285 

A1TACGTC TGCACITOCAIGACroC^^ CATATEACTGGA 

iiiiiiii Miiiiiiiiiiiiiiiiiiiiiiiiiiiiiii inn 

TAATCC3GACCTGAA GGmJJX^CGANX^ GACCT 
130 140 150 160 170 180 

BB286 

BB287 BB289 
CAACACCACCaGITia^ ATGCAITCEGGAAOGCIAACAGOGC^^ 

1 1 ii iii mi i mi mil ii in ii i m^m^m^^ 

190 200 210 220 230 240 

BB288 BB290 

BB291 

GAAAGGTCACEAGCT AGOGTACTICCISAaSGCAACTC^^ GTC 
250 260 270 280 290 300 

BB292 

BB293 BB295 
ITCIO^GACCIGC^^ AGOGIGACTC37IGCAGGOGGACXIX3IC 

1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 f 1 1 1 1 1 1 1 1 1 1 JJLUJJUJ^ 

AACACX^TCTQGACGACTGA^ GAGAAQ3TOOGCXTGGCAG 
310 320 330 340 350 360 

BB294 BB296 

BB297 

CIGGAGAGIGCOG CTCGCTTCEACXTI^^ 

IMIIIIMIIII MIIIIIIIIIIIIIMIIIIIIIlllllilMI^ 
Gzcczxnxxj^^ cTCGAcrcAa^G^m^ 

370 380 390 400 410 420 

BB298 

BB299 BB301 
AOGCC^CTrcCCTCC^^ GCTCAAGGAII3^GCTITAGAAACX^ 

IIIIMIIIIIiMMIiiiiiiiiiiiii ii!!!'li ! iJ4i^ 

GMCGGACQbGG^ CEATOGAAATCTITGCACCC 
430 440 450 460 470 480 

BB300 BB302 

TCTGAATOGCTOGAGG 3' 

IMillllllMIII! 

AGACITAGOGAGCKrTr^ 5' 
490 500 
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DESIGN OF OLTGCMERS FOR SYNTHETIC HORSERADISH PEROXIDASE GENE 3' HALF. 



BB303 

' AGCTIAACTOGAGr[GACnr^^ GGAAAGAACCA 

IIIIIIIIIIIIMIIIIIMIIIIIIIIIIIMMMIIMI II II 

490 500 510 520 530 540 

BB304 

BB305 BB307 

CTCTAGKnXZATCAIK^^ TTTCBGC^ACACTGC^^ 

IMIIMIIMIIMIIIIIIIIIIIIIII IIIIIIIIIIIIIIIMIIIIII 

CACATCX^ACTAI^OT TIGTCACCXZAAIGGACTGGGOro 
550 560 570 580 590 600 

BB306 BB308 

BB309 

GCDGAAO^CIACGEA TCTCCAGACACK^GA^ 

Mlllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
OSACTICTGATGGATAGAGGTC TCTGACIXZKX^AACAOS^^ 

610 620 630 640 650 660 

BB310 

BB311 

AGIGCACTAGTCGAC![PIT^ ATAAC1AAGTACTA3CTGAA 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII llllllllllll 

ATCAOILGAAACTAGAOGCCTC^ CAIGATACACTT 
670 680 690 700 710 720 

BB312 

BB313 BB315 
TCTAGAGGAGCAGAAAGGCrrrGA TACAGACTG&TCAAGAACI^^ 

IIHIIIIIIIIIIIIIIIMII llllllllllllllllllllllllllllll 

AGATCIXXTraSIKJOT ACTAGITCTIGACAAATC^^ 

730 740 750 760 770 780 

BB314 BB316 

BB317 

CACTGA CACCATTCCACIX^^ CXOTCTTEAA03C 

llllll IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII MUM 

GGDGACDGTOGTEAG GGTCACX^CTCTIX^^ ATIGOG 
790 800 810 820 830 840 

BB318 

BB319 BB321 
CITa2KG&AGCCA2^ TAaXCTCTGAOGGCTAOX^^ 

IMIIIIIIIIIIIIIIIIIIMIIIIIIIIII IMIIIMMIIIMMIM 

GAAGCaCClTCX22IA^^ GACIGCCCAIQ3GITCOGGT 
850 860 870 880 890 900 

BB320 BB322 
BB323 

gaitostcigaacdgc agagigg^k^o^c^ 3' 
IMMIMMMIMI IMMMMIMMMIIIMIIMM 

CEAAGCAGACITCAajrci^^ ACTTGTOGTIGAGAAIT^^ 5' 
910 920 930 940 950 

BB324 
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SUMMARY OF ASSEMBLE PROCEDURE, 5' HALF. 

a) kinased oligomers annealed in paixs and mixed in two groups 

(A & B) . 

BB279 BB281 BB283 BB285 BB287 BB289 

AGcrr p p p p p 

A p p P p p 



BB280 BB282 BB284 BB286 BB288 BB290 



BB291 BB293 BB295 BB297 BB299 BB301 
p p p p p p G 

p p p p p CTTAA 

BB292 BB294 BB296 BB298 BB300 BB302 



b) oligomers ligabed together in two groups. (BB279 and BB302 not 

kinased to avoid imiltimerisation. 



fragment A 
AGCIT 



-ACHAGCT 



p fragment B 

TCATOGA ■ Xhol — G 

Xhol — CITAA 

c) The ligations were checked for the presence of fragment A & B on 
2% agarose gels then the ligation reactions were mixed and the 
reaction allowed to continue to give the firvO product. 

A3CTT TC&TOGA Xhol— G 

A ACEAGCT Xhol — CCTAA 

d) The HHP gene fragment was isolated on a 2% LCT agarose gel and 
cloned into EcoEI/HinDm cut pU£L8. 

HRP 5' half p 

vwva AGCTT — Xhol — G aattcvwvv 

vwvttoga A Xhol — CJTAA gvww 

P 



HinDIII HRP 5' half EcoRI 

vwvaAGCTT -Xhol — Gaattcwwv 

wwttogaA X hol — CITAAgvww 



v = vector sequence 
p = 5 # phosphates 
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FIGURE 4t> 10 A5 
SUMMARY OF ASSEMBLY EBOCEDUKE, 3' HALF. 

a ) kinased oligomers annealed in pairs and mixed in two grcups (A & 
B) . 

BB303 BB305 BB307 BB309 BB311 BB313 

AGCXT p P P P P 

A p P P P P P 

BB304 BB306 BB308 BB310 BB312 EB314 

BB315 BB317 BB319 BB321 BB323 

p p p P P G 

p p p p CTXAA 

BB316 BB318 BB320 BB322 BB324 

b) oligcaners ligated together in two groups. (BB303 and BB324 not 
kinased to avoid irultijiisrisation . 

fragment C 

AGCTT — Xhol- 



A — Xhol ATCTCTC 

P 



p fragment D 
TACAGAG- 



-CTEAA 



c) The ligations were checked for the presence of fragment C & D on 

2% agarose gels then the ligation reactions were mixed and the 
reaction allowed to continue to give the final product. 



AGCTT — Xhol TACAGAG G 

A — Xhol ATCTCTC CTEAA 

d) The HRP gene fragment was isolated en a 2% IGT agarose gel and 

cloned into EcdRI/HinDIH cut pUCL8 

HRP 3' half p 
wwa AGCTT — Xhol - G aattcwvw 

wwttcga A — Xhol — CTEAA gvww 

P 

HinDHI HRP 3' half EccRI 

wwaAGCIT — Xhol — Gaat tcvww 

wwttogaA— Xhol CTrAAgvww 



v = vector sequence 
p = 5' phoshates 
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FIGURE 4c 11/15 
SUMMARY OF FINAL ASSEMBLY PROCEDURE. 



a) 5' and 3' clones of HRP cloned in pOCLS were digested with Xhol 

and EcciRI. Relevant fragments fran each digest were isolated 
from a 0.8% IGT agarose gel. 

HinPTTT HRP 5' HAUF Xhol EocRI 

wvwAAGCTT CTOGAG GAAITCvww 

wwvlTOGA A ■ GAGCTC CTTAAGvww 



HinDHI Xhol HRP 3' HALF EcoRI 
wvwAAGCTT— -CTOGAG G &ATTCvww 
WVWTTOGAA GAGCTC CTTAAGvvvvv 



b) XhoI/BooRI fragment carrying 3' half of HRP ligated into 

XhoI/EcbRI cut HRP 5' half clone. 



HRP 3' HALF 
TOGAG 5 



-CTTAA. 



HRP 5' HALF 

wwvAAGCrr C AASTCvww 

wwvTTOGA A GAGCT Gwvw 



c) Completed gene cloned in pOC18. 



HRP 5' HALF HRP 3' HAIF 

wvwAAGCTT CTOGftS GAAITCwvw 

wwvlTOGAA GAGCTC CTEAAGvww 



v = vector sequence 
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Figure 6^ Synthetic HRP Gene Modified for Expression in Mammalian 
Cells. 



Mkcswvifflmavvtgv 
AAGCTTCCACCA TGAAGTGCTCCTGGGTGATCTTCTTCCTGATGGCCGTGG TGACCGGCG 

H3 2U 3TT 4U 5T5 50 

n SOQ LTPTFYDNSCPNVSNIV 
TGAACTCCCAGTTAACCCCTACATTCTACGACAATAGCTGTCCCAACGTGTCCAACATCG 
VOHpa l 80 90 100 110 120 

RDTIVNELRSDPRIAASILR 
TTCGCGACACAATCGTCAACGAGCTCAGATCCGATCCCAGGATCGCTGCTTCAATATTAC 
130 140 150 160 170 180 

LHFHDCFVNGCDAS I LLDNT 
GTCTGCACTTCCATGACTGCTTCGTGAATGGTTGCGACGCTAGCATATTACTGGACAACA 
190 200 210 220 230 240 

TSFRTEKDAFGNANSARGFP 
CCACCAGTTTCCGCACTGAAAAGGATGCATTCGGGAACGCTAACAGCGCCAGGGGCTTTC 
250 260 270 280 290 300 

V I DRM KAAVE SAC P R TVS C A 
CAGTGATCGATCGCATGAAGGCTGCCGTTGAGTCAGCATGCCCACGAACAGTCAGTTGTG 
310 320 330 340 350 360 

DLLTIAAQQSVTLAGGPSWR 
CAGACCTGCTGACTATAGCTGCGCAACAGAGCGTGACTCTTGCAGGCGGACCGTCCTGGA 
370 380 390 400 410 420 

VPLGRRDSLQAFLDLANANL 
GAGTGCCGCTCGGTCGACGTGACTCCCTACAGGCATTCCTAGATCTGGCCAACGCCAACT 
430 440 450 460 470 480 

PAPFFTLPQLKDSFRNVGLN 
TGCCTGCTCCATTCTTCACCCTGCCCCAGCTGAAGGATAGCTTTAGAAACGTGGGTCTGA 
490 500 510 520 530 540 

RSSDLVALSGGHTFGKNQCR 
ATCGCTCGAGTGACCTTGTGGCTCTGTCCGGAGGACACACATTTGGAAAGAACCAGTGTA 
550 560 570 580 590 600 

FIMDRLYNFSNTGLPDPTLN 
GGTTCATCATGGATAGGCTCTACAATTTCAGCAACACTGGGTTACCTGACCCCACGCTGA 
610 620 630 640 650 660 

TTYLQTLRGLCPLNGNLSAL 
ACACTACGTATCTCCAGACACTGAGAGGCTTGTGCCCACTGAATGGCAACCTCAGTGCAC 
670 680 690 700 710 720 
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VDFDLRTPTI FDNKYYVNLE 
TAGTGGACTTTGATCTGCGGACCCCAACCATCTTCGATAACAAGTACTATGTGAATCTAG 
730 740 750 760 770 780 

EQKGLIQSDQELFSSPNATD 
AGGAGCAGAAAGGCCTGATACAGAGTGATCAAGAACTGTTTAGCAGTCCAAACGCCACTG 
790 800 810 820 830 840 

TIPLVRSFANSTQTFFNAFV 
ACACCATCCCACTGGTGAGAAGTTTTGCTAACTCTACTCAAACCTTCTTTAACGCCTTCG 
850 860 870 880 890 900 

EAMDRMGNITPLTGTQGQIR 
TGGAAGCCATGGACCGTATGGGTAACATTACCCCTCTGACGGGTACCCAAGGCCAGATTC 
910 920 930 940 950 960 

LNCRVVNSNSllhdmvevvd 
GTCTGAACTG CAGAGTGGTCAACAGCAACTCTCTACTCCATGATATGGTGGAGGTCGTTG 

Pit! 9T5 3T5T575 IT5T5" 

f v s s m * - * 
ACTTTGTTAGCTCTATGTAATAAGGATCCGAATTC 

1030 T04T5 ecori 

KEY 



Underlined sequences indicate linkers used to adapt synthetic gene. 
Lower case residues indicate N and C terminal pre & pro sequences* 
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Figure 7^ The HRP Expression Plasmid PCP21. 
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